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NM_173674 2010 bp mRNA linear PRI 05 -OCT- 2 003 

Homo sapiens discoidin, CUB and LCCL domain containing 1 (DCBLDl) , 
mRNA. 
NM_173674 

NM_173674.1 GI : 27735142 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Crania ta; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini ; Hominidae; Homo. 
1 (bases 1 to 2010) 

Strausberg, R.L. , Feingold, E . A . , Grouse, L.H., Derge,J.G., 
Klausner, R.D. , Collins , F . S . , Wagner, L . , Shenmen , C . M . , Schuler, G . D . , 
, Zeeberg,B., Buetow,K.H., Schaef er , C - F . , Bhat,N.K. 7 . . 
Jordan, H. , Moore, T., Max,S.I., Wang, J., Hsieh,F., 
Maru'sina,K. , Farmer, A. A. , Rubin , G.M., Hong,L., 
t Soares,M.B., Bonaldo,M.F . , Casavant , T . L . , 
, Brownstein, M. J. , Usdin,T.B., Toshiyuki , S . , 
Prange,C., Raha,S.S., Loquellano, N . A . , Peters, G. J., 



Altschul, S.F. 
Hopkins , R . F . , 
Diatchenko , L . , 
Stapleton, M. , 
Scheetz , T . E . , 
Carninci, P. , 

Abramson , R . D . , Mullahy , S . J . , Bosak,S.A., McEwan,P.J., 
McKernan, K. J. , Malek,J.A., Gunaratne, P . H . , Richards, S., 
Worley,K.C, Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., 
Villalon,D.K. , Muzny, D . M . , Sodergren, E . J . , Lu,X., Gibbs,R.A., 
Fahey,J., Helton, E . , Ketteman,M., Madan,A., Rodrigues , S . , 
Sanchez, A., Whiting, M. , Madan,A. , Young , A . C . , Shevchenko, Y. , 
Bouf fard,G.G. , Blakesley, R. W. , Touchman , J . W . , Green, E.D., 
Dickson, M . C . , Rodriguez , A. C . , Grimwood,J., Schmutz,J., Myers,R.M., 
Butterf ield, Y. S . , Krzywinski , M . I . , Skalska,U., Smailus, D. E. , 
Schnerch,A. , Schein,J.E., Jones, S.J. and Marra,M.A. 
Generation and initial analysis of more than 15,000 full-length 
human and mouse cDNA sequences 

Proc. Natl. Acad. Sci . U.S.A. 99 (26), 16899-16903 (2002) 

22388257 

12477932 

PROVISIONAL REFSEQ : This record has not yet been subject to final 
NCBI review. The reference sequence was derived from BC035671 . 1 . 

Location/ Qualifiers 

1. .2010 

/organism= " Homo sapiens " 
/mol_type= "mRNA" 
/db_xref = " taxon : 9 60 6 " 
/chromosome=" 6 " 
/map="6q22 .31" \ 
1..2010 

/gene= "DCBLDl" 

/note= " synonyms : MGC46341, dJ94G16.1" 
/db_xref="GeneID: 285761" 
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/ db„xre f = M Locus ID : 285761 fl 
CDS 126.. 1745 

/gene="DCBLDl" 

/note= n go_process: cell adhesion [goid 0007155] [evidence 
IEA] " 

/codon_start=l 

/product^" hypothetical protein MGC46341" 
/protein_id= " NP_775945 . 1 " 
/db_xref="GI: 27735143 " 
/db_xref="GeneID: 285761" 
/db_xre f = " Locus ID : 285761 " 

/ 1 rans lat i on= " MVPGARGGGALARAAGRGLLALLLAVSAPLRLQAEELGDGCGHL 
VT YQD S GTMT S KNYPGT Y PNHTVC E KT I TVPKGKRD I LRLGDLD I E S QTCAS D YLLF T 
SSSDQYGPYCGSMTVPKELLLNTSEVTVRFESGSHISGRGFLLTYASSDHPDLITCLE 
RASHYLKTEYSKFCPAGCRDVAGDISGNMVDGYRDTSLLCKAAIHAGIIADELGGQIS 
VLQRKGISRYEGILANGVLSRDGSLSDKRFLFTSNGCSRSLSFEPDGQIRASSSWQSV 
NESGDQVHWSPGQARLQDQGPSWASGDSSNNHKPREWLEIDLGEKKKITGIRTTGSTQ 
SNFNF YVKS F VMNFKNNNS KWKT YKG I VNNEE KVFQGNSNF RD P VQNNF I P P I VARYV 
RWPQTWHQRIALKVELIGCQITQGNDSLVWRKTSQSTSVSTKKEDETITRPIPSEET 
STG INITTVA I PLVLLWLVFAGMG I FAAFRKKKKKGS P YGS AEAQKTDCWKQI KYPF 
ARHQSAEFTI S YDNEKEMTQKLDLITSDMAG " 
misc_feature 246.. 572 

/gene="DCBLDl" 

/note="CUB; Region: Domain first found in Clr, Cls, uEGF, 
and bone morphogenetic protein. This domain is found 
mostly among developmental ly-regulated proteins. 
Spermadhesins contain only this domain" 
/db_xref=" CDD : smart00042 " 
misc_feature 591.. 869 

/gene="DCBLDl n 

/note="LCCL; Region: LCCL domain" 
/ db_xr e f = " CDD : p f amO 3 815 " 
misc_feature 912 . . 1352 

/gene=" DCBLD1 " 

/note="F5_F8__type_C; Region: F5/8 type C domain. This 
domain is also known as the discoidin (DS) domain family. 
The bacterial examples are not yet included in the SEED 
alignment and are only found with low scores" 
/ db_xref = " CDD : pfam00754 " 

BASE COUNT 577 a 457 c 512 g 464 t 

ORIGIN 

1 ggcagcgact gcgccccgtc ccggcgccgc gctcgtccgc agaggaggcg gcccggcccg 
61 ggcagctgcg gctcgggatc cgtcgagggg aggccgagct tgccaagctg gcgcccagcg 
121 gggtcatggt gcccggcgcc cgcggcggcg gcgcactggc gcgggctgcc gggcggggcc 
181 tcctggcttt gctgctcgcg gtctccgccc cgctccggct gcaggcggag gagctgggtg 
241 atggctgtgg acacctagtg acttatcagg atagtggcac aatgacatct aagaattatc 
3 01 ccgggaccta ccccaatcac actgtttgcg aaaagacaat tacagtacca aaggggaaaa 
361 gactgattct gaggttggga gatttggata tcgaatccca gacctgtgct tctgactatc 
421 ttctcttcac cagctcttca gatcaatatg gtccatactg tggaagtatg actgttccca 
481 aagaactctt gttgaacaca agtgaagtaa ccgtccgctt tgagagtgga. tcccacattt 
541 ctggccgggg ttttttgctg acctatgcga gcagcgacca tccagattta ataacatgtt 
601 tggaacgagc tagccattat ttgaagacag aatacagcaa attctgccca gctggttgta 
6 61 gagacgtagc aggagacatt tctgggaata tggtagatgg atatagagat acctctttat 
721 tgtgcaaagc tgccatccat gcaggaataa ttgctgatga actaggtggc cagatcagtg 
781 tgcttcagcg caaagggatc agtcgatatg aagggattct ggccaatggt gttctttcga 
841 gggatggttc cctgtcagac aagcgatttc tgtttacctc caatggttgc agcagatcct 
901 tgagttttga acctgacggg caaatcagag 'cttcttcctc atggcagtcg gtcaatgaga 
961 gtggagacca agttcactgg tctcctggcc aagcccgact tcaggaccaa ggcccatcat 
1021 gggcttcggg cgacagtagc aacaaccaca aaccacgaga gtggctggag atcgatttgg 
1081 gggagaaaaa gaaaataaca ggaattagga ccacaggatc tacacagtcg aacttcaact 
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1141 tttatgttaa gagttttgtg atgaacttca 
1201 aaggaattgt gaataatgaa gaaaaggtgt 
1261 tgcaaaacaa tttcatccct cccatcgtgg 
1321 ggcaccagag gatagccttg aaggtggagc 
1381 attcattggt gtggcgcaag acaagtcaaa 
1441 agacaatcac aaggcccatc ccctcggaag 
1501 tggctattcc attggtgctc cttgttgtcc 
1561 cctttagaaa gaagaagaag aaaggaagtc 
1621 actgttggaa gcagattaaa tatccctttg 
1681 gctatgataa tgagaaggag atgacacaaa 
1741 gttaactccg ttgactgcca aaatagcatc 
1801 aggttgcccc ggatggatct cagagatgag 
18 6,1 acaacaacaa agggcagtaa attaaagtac 
1921 agagataaaa tattttctta aaaatatatt 
1981 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 



aaaacaataa ttctaagtgg aagacctata 
ttcagggtaa ctctaacttt cgggacccag 
ccagatatgt gcgggttgtc ccccagacat 
tcattggttg ccagattaca caaggtaatg 
gcaccagtgt ttcaactaag aaagaagatg 
aaacatccac aggaataaac attacaacgg 
tggtgtttgc tggaatgggg atctttgcag 
cgtatggatc agcagaggct cagaaaacag 
ccagacatca gtcagctgag tttaccatca 
agttagatct catcacaagt gatatggcag 
cccaacgtgc agccctccgc atctatcagc 
gatcggaaca ccatgttctt tcccacccta 
tctttgtaag gtacagttac cgattaatct 
tcattaaaca cctatgctgt ctctataaaa 
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FASTA searches a protein or DNA sequence data bank 
version 3.3t05 March 30, 2000 
Please cite: 

W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 

/tmp/fastaDAAE.aqrX: 1464 nt 

>LEX 71 Seq id no:l 

vs /tmp/fastaEAAF.aqrX library- 
searching /tmp/f astaEAAF .aqrX library 

2010 residues in 1 sequences 

FASTA (3.04 January 2000) function [optimized, +5/-4 matrix (5:-4)] ktup : 6 

join: 63, opt: 48, gap-pen: -16/ -4, width: 16 

Scan time: 0.116 
The best "scores are: opt 
gij27735142|ref | NM_173 674 . 1 | Homo sapiens disc (2010) [f] 7311 
gi j 27735142 jref | NM_173674 . 1 | Homo sapiens disc (2010) [r] 67 

»gi | 27735142 | ref |NM_173674 .1 1 Homo sapiens discoidin, C (2010 nt) 
initn: 7311 initl: 7311 opt: 7311 

99.932% identity in 1464 nt overlap (1-1464:282-1745) 

10 20 30 

LEX ATGAC ATC T AAGAATT ATCCC GGG ACC TAC 



g i | 2 7 7 CACCTAGTGACTTATC AGGATAGTGGC ACAATGAC ATCTAAGAATTATCCCGGGACC TAC 
260 270 280 290 300 310 

40 50 60 70 80 90 

LEX CCCAATCACACTGTTTGCGAAAAGACAATTACAGTACCAAAGGGGAAAAGACTGATTCTG 

gi | 2 7 7 CCC AATCACACTGTTTGCGAAAAGACAATTACAGTACCAAAGGGGAAAAGACTGATTCTG 
320 330 340 350 360 370 

100 110 120 130 140 150 

LEX AGGTTGGGAGATTTGGATATCGAATCCCAGACCTGTGCTTCTGACTATCTTCTCTTCACC 

gi | 277 AGGTTGGGAGATTTGGATATCGAATCCCAGACCTGTGCTTCTGACTATCTTCTCTTCACC 
380 390 400 410 420 430 

160 170 180 190 200 210 

LEX AGC TC TTC AG ATC AATATGGTC C ATAC TGTGG AAGT ATG ACTGTTC CC AAAG AACTC TTG 

gi | 277 AGCTCTTCAGATCAATATGGTCCATACTGTGGAAGTATGACTGTTCCCAAAGAACTCTTG 
440 450 460 470 480 490 

220 230 240 250 260 270 

LEX ■ TTG AAC AC AAGTG AAGTAAC C GTC CGC TTTG AG AGTGGATC C C AC ATTTC TGGC C GGGGT 

gi | 277 TTG AAC AC AAGTG AAGTAAC C GTC CGC TTTG AGAGTGG ATCCC AC ATTTC TGGCC GGGGT 
500 510 520 530 540 550 

280 290 300 310 320 330 

LEX TTTTTGC TG AC C TATGC GAGC AGC G AC C ATC C AG ATTTAAT AAC ATGTTTGGAAC GAGC T 

gi | 2 7 7 TTTTTGCTG AC C TATGC GAGC AGC GAC CATC C AG ATTTAAT AAC ATGTTTGGAAC G AGC T 
560 570 580 590 600 610 

340 350 360 370 380 390 
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g i | 2 7 7 AGC C ATTATTTGAAGAC AGAATAC AGC AAATTCTGCCC AGCTGGTTGTAGAGACGTAGC A 
620 630 640 650 660 670 

400 410 420 430 440 450 

LEX GGAGACATTTCTGGGAATATGGTAGATGGATATAGAGATACCTCTTTATTGTGCAAAGCT 



gi | 2 7 7 GGAGACATTTCTGGGAATATGGTAGATGGATATAGAGATACCTCTTTATTGTGCAAAGCT 
680 690 700 710 720 730 

460 470 480. 490 500 510 

LEX GCCATCCATGCAGGAATAATTGCTGATGAACTAGGTGGCCAGATCAGTGTGCTTCAGCGC 



gi | 277 GCCATCCATGCAGGAATAATTGCTGATGAACTAGGTGGCCAGATCAGTGTGCTTCAGCGC 
740 750 760 770 780 790 

520 530 540 550 560 570 

LEX AAAGGGATCAGTCGATATGAAGGGATTCTGGCCAATGGTGTTCTTTCGAGGGATGGTTCC 



gi | 2 77 AAAGGGATCAGTCGATATGAAGGGATTCTGGCCAATGGTGTTCTTTCGAGGGATGGTTCC 
800 810 820 830 840 850 

580 590 600 610 620 630 

LEX CTGTCAGACAAGCGATTTCTGTTTACCTCCAATGGTTGCAGCAGATCCTTGAGTTTTGAA 



gi | 2 7 7 C TGTC AGAC AAGC G ATTTC TGTTT AC C TC C AATGGTTGC AGC AG ATC C TTGAGTTTT G AA 
860 870 880 890 900 910 

640 650 660 670 680 690 

LEX CCTGACGGGCAAATCAGAGCTTCTTCCTCATGGCAGTCGGTCAATGAGAGTGGAGACCAA 



gi | 277 CCTGACGGGCAAATCAGAGCTTCTTCCTCATGGCAGTCGGTCAATGAGAGTGGAGACCAA 
920 930 940 950 960 970 

700 710 720 730 740 750 

LEX GTTCACTGGTCTCCTGGCCAAGCCCGACTTCAGGACCAAGGCCCATCATGGGCTTCGGGC 



gi | 277 GTTC ACTGGTC TC C TGGCC AAGC C CGAC TTC AGG AC C AAGGCC C ATC ATGGGC TTC GGGC 
980 990 1000 1010 1020 1030 

760 770 780 790 800 810 

LEX GACAGTAGCAACAACCACAAACCACGAGAGTGGCTGGAGATCGATTTGGGGGAGAAAAAG 



gi | 2 7 7 GACAGTAGCAACAACCACAAACCACGAGAGTGGCTGGAGATCGATTTGGGGGAGAAAAAG 
1040 1050 1060 1070 1080 1090 

820 830 840 850 860 870 

LEX AAAAT AAC AGGAAT T AGG AC C AC AGG ATC T AC AC AGTC G AAC TTC AAC TTTT ATGTT AAG 



gi | 277 AAAATAAC AGG AATT AGG ACC AC AGG ATC T AC AC AGTC G AAC TTC AAC TTTT ATGTT AAG 
1100 1110 1120 1130 1140 1150 

880 890 900 910 920 930 

LEX AGTTTTGTGATGAACTTCAAAAACAATAATTCTAAGTGGAAGACCTATAAAGGAATTGTG 

gi | 277 AGTTTTGTGATGAACTTCAAAAACAATAATTCTAAGTGGAAGACCTATAAAGGAATTGTG 
1160 1170 1180 1190 1200 1210 

940 950 960 970 980 990 
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gi | 277 AATAATGAAGAAAAGGTGTTTCAGGGTAACTCTAACTTTCGGGACCCAGTGCAAAACAAT 
1220 1230 1240 1250 1260 1270 

1000 1010 1020 1030 1040 1050 

LEX TTC ATC C CTC C C ATCGTGGC C AG ATATGTGCGGGTTGTC CC C C AGAC ATGGC AC C AG AGG 



gi | 277 TTC ATC CCTCC CATC GTGGC C AGATATGTGCGGGTTGTC C CCC AG AC ATGGC AC C AG AGG 
1280 1290 1300 1310 1320 1330 

1060 1070 1080 1090 1100 1110 

LEX ATAGCCTTGAAGGTGGAGCTCATTGGTTGCCAGATTACACAAGGTAATGATTCATTGGTG 



gi | 277 ATAGCCTTGAAGGTGGAGCTCATTGGTTGCCAGATTACACAAGGTAATGATTCATTGGTG 
1340 1350 1360 1370 1380 1390 

1120 1130 1140 1150 1160 1170 

LEX TGGCGCAAGACAAGTCAAAGCACCAGTGTTTCAACTAAGAAAGAAGATGAGACAATCACA 



gi | 277 TGGCGCAAGACAAGTCAAAGCACCAGTGTTTCAACTAAGAAAGAAGATGAGACAATCACA 
1400 1410 1420 1430 1440 1450 

1180 1190 1200 1210 1220 1230 

LEX AGGCCCATCCCCTCGGAAGAAACATCCACAGGAATAAACATTACAACGGTGGCTATTCCA 

gi | 277 AGGCCCATCCCCTCGGAAGAAACATCCACAGGAATAAACATTACAACGGTGGCTATTCCA 
1460 1470 1480 1490 1500 1510 

1240 1250 1260 1270 1280 1290 

LEX TTGGTGC TC C TTGTTGTCC TGGTGTTTGC TGG AATGGGG ATC TTTGC AGC C TTTAGAAAG 



gi | 2 77 TTGGTGC TC C TTGTTGTCC TGGTGTTTGCTGGAATGGGGATC TTTGC AGC C TTTAGAAAG 
1520 1530 1540 1550 1560 1570 

1300 1310 1320 1330 1340 1350 

LEX AAG AAG AAG AAAGGAAGTCCGTATGGATCAGCGGAGGCTCAGAAAAC AGAC TGTTGG AAG 

gi | 277 AAGAAGAAGAAAGGAAGTCCGTATGGATCAGCAGAGGC TC AG AAAAC AGAC TGTTGG AAG 
1580 1590 1600 1610 1620 1630 

1360 1370 1380 1390 1400 1410 

LEX C AG ATT AAAT ATC C CTTTGC C AG AC ATC AGTCAGC TG AGTTT AC C ATC AGC T ATGAT AAT 



gi | 277 C AG ATTAAATATCCC TTTGC C AGAC ATC AGTCAGC TGAGTTTACC ATC AGCTATGAT AAT 
1640 1650 1660 1670 1680 1690 

1420 1430 1440 1450 1460 

LEX GAGAAGGAGATGACACAAAAGTTAGATCTCATCACAAGTGATATGGCAGGTTAA 



gi | 277 GAGAAGGAGATGACACAAAAGTTAGATCTCATCACAAGTGATATGGCAGGTTAACTCCGT 
1700 1710 1720 1730 1740 .1750 

gi | 277 TGACTGCCAAAATAGCATCCCCAACGTGCAGCCCTCCGCATCTATCAGCAGGTTGCCCCG 
1760 1770 1780 1790 1800 1810 

»gi | 27735142 | ref | NM_173674 . 1 | Homo sapiens" discoidin, C (2010 nt) 
rev-comp initn: 57 initl: 57 opt: 67 
61.364% identity in 44 nt overlap (492-449:1780-1823) 
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520 510 500 490 • 480 470 

LEX- ACTGATCCCTTTGCGCTGAAGCACACTGATCTGGCCACCTAGTTCATCAGCAATTATTCC 



gi | 277 GTTGAC TGC C AAAATAGC ATC C C C AAC GTGC AGC C CTC C GC ATC T ATC AGC AGGTTGC C C 
1750 1760 1770 1780 1790 1800 

460 450 440 430 420 410 

LEX- TGCATGGATGGCAGCTTTGCACAATAAAGAGGTATCTCTATATCCATCTACCATATTCCC 

gi | 277 CGGATGGATCTCAGAGATGAGGATCGGAACACCATGTTCTTTCCCACCCTAACAACAACA 
1810 1820 1830 1840 1850 1860 



14 64 residues in 1 query sequences 
2 010 residues in 1 library sequences 
Scomplib [version 3.3t05 March 30, 2000] 

start: Mon Oct 6 13:26:41 2003 done: Mon Oct 6 13:26:41 2003 
Scan time: 0.116 Display time: 0.100 

Function used was FASTA 
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